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Abstract — Random finite sets (RFSs) lias been a fruitful area 
of researcii in recent years, yielding new approximate methods 
for multiple target tracking such as the probability hypothesis 
density (PHD), cardinalised PHD (CPHD), and multiple target 
multi-Bernoulli (MeMBer) filters. These new approaches have 
largely been based on approximations that side-step the need for 
measurement-to-track association in order to maintain tractabil- 
ity. In this paper, we derive a conjugate prior form for the 
full Bayes RFS filter under commonly invoked assumptions. 
We observe that data association is implicitly present in the 
form, and obtain alternative algorithms by approximating the 
discrete distribution of joint association events utilising a recently 
developed, high quality approximation of marginal association 
probabilities. The two resulting algorithms maintain a hybrid 
MeMBer/Poisson distribution of objects, but form Bernoulli 
components differently to the conventional MeMBer filter. The 
track-oriented marginal MeMBer/Poisson (TOM-MeMBer/P) is 
shown to be closely related to a variant of the popular joint 
integrated probabilistic data association (JIPDA) algorithm. The 
measurement oriented MeMBer/P (MOM-MeMer/P) adopts the 
structure of the MeMBer filter (collecting all hypotheses utilising 
a given measurement into a single multi-Bernoulli component), 
but uses marginal association probabilities to provide a significant 
improvement to performance in low probability of detection 
cases. We demonstrate both the performance and tractability 
of the filters in a challenging scenario involving multiple targets 
in close proximity. 



I. Introduction 

Recently, much work has been devoted to random finite set 
(RFS)-based approximations such as the probability hypoth- 
esis density (PHD) |1|, cardinalised PHD (CPHD) Q and 
multiple target multi-Bernoulli (MeMBer) filters fSl. A key 
characteristic of these methods has been their tractability, 
which is the consequence of clever approximations that avoid 
data association, i.e., reasoning over the correspondence of 
measurements and targets. For example, the PHD approxi- 
mates the posterior distribution as a Poisson point process 
(PPP), for which updates can be calculated without data 
association. The CPHD improves upon this by calculating the 
cardinality distribution, and using an independently, identically 
distributed (iid) cluster process approximation for which up- 
dates can be calculated efficiently; CPHD has been shown to 
improve performance significantly over PHD in challenging 
environments {e.g., |4|). The MeMBer makes approximations 
in its derivation that avoid association and are reasoned to 
be of little consequence when the false alarm rate is low; 
these approximations introduced a significant cardinality bias 
that was corrected in the cardinality balanced MeMBer (CB- 
MeMBer) |j5|. 

This paper is tJNCLASSIFIED and is approved for public release. 



Outside of the RFS field, two major approaches are joint 
probabilistic data association (JPDA) |6| and multiple hy- 
pothesis tracking (MHT) ]7|8) ; both explicitly formulate and 
reason over association hypotheses, i.e., hypotheses for the 
correspondence of measurements and targets. JPDA seeks to 
marginalise over the random variables representing association 
in order to calculate the marginal distribution of each track; 
joint integrated PDA (JIPDA) [9 lOJ and joint integrated track 
splitting (JITS) 1 10|11 1 extend this by incorporating target 
existence as an additional random variable to be estimated. 
MHT seeks to calculate the most likely association hypothesis, 
returning a tracking estimate conditioned on the hypothesis. 
Track oriented MHT (TOMHT) |12| provides a ti-actable 
approach for implementing MHT by maintaining a hypothesis 
tree for each track, and finding the most likely joint hypothesis 
indirectly (e.g., using Lagrangian relaxation ITS)). 

While the RFS framework avoids the need for explicitly 
modelling data association, it implicitly arises in the derivation 
of the full Bayes RFS filter This paper seeks to apply 
the high quality, tractable approximation of the marginal 
association probabilities 1 14|15| to tracking within the RFS 
framework, as an alternative to the approximations of PHD, 
CPHD or MeMBer Section [III] presents a derivation of the 
full Bayes RFS filter under commonly invoked assumptions, 
demonstrating a conjugate prior form for the filter (i.e., a 
form that is preserved by prediction and update); preliminary 
versions were available in the pre-prints 1 16|17| . The form 
incorporates a PPP component, and a second component that 
is structurally similar to TOMHT. In Section |IV[ we observe 
that data association implicitly arises in the derivation, and 
subsequently approximate the discrete distribution of data 
association hypotheses. The first approximation yields the 
track oriented marginal MeMBer/Poisson (TOM-MeMBer/P) 
filter, which is very similar to JITS and JIPDA; the differences 
relate to the inclusion of a Bayesian model of target birth in 
our RFS derivation. Thus, with minor changes, the popular 
JITS/JIPDA algorithms (and extensions thereof) can be derived 
within the RFS framework. The second approximation yields 
the measurement oriented marginal MeMBer/Poisson (MOM- 
MeMBer/P) filter, which collects all hypotheses using a given 
measurement into a single Bernoulli component in a manner 
similar to the MeMBer filter of p], but using the marginal as- 
sociation weights. In both cases, the marginal measurement-to- 
track association probabilities are approximated using the high 
quality, tractable procedure described in |I4|I5| . Section [Vl] 
demonstrates the performance of the proposed methods in a 
challenging scenario, showing that improved performance is 
obtained in comparison to CPHD and MeMBer (particularly 



in cases with a lower probability of detection) for a similar 
computational load. 

II. Background 

RFS -based methods Q have been developed in order to 
conduct statistical inference in problems in which the variables 
of interest and/or the observations form finite sets. The meth- 
ods are particularly applicable to tracking since they address 
two of the major challenges involved: 

• The number of targets present in the scene is unknown 

• Measurements are unordered, and measurement-to-target 
correspondence is unknown 

Problems such as this can be conveniently formulated by 
modelling the system state as a set of target stateaM Xt — 
{xj , . . . , a;"*} (at time t), and incorporating set-valued mea- 
surements Zt — {z}, . . . , z"^*}. RFS densities encode both 
uncertainty in the number of targets, and their states. A RFS 
density can be constructed from a cardinality distribution 
p{n), n > 0, and a series of cardinality-conditioned joint 
distributions fn{xi, ■ ■ ■ ,a;„), yielding 

f{{xi,...,Xn}) ^ P{n)^ fn{x.^(^i), . . . ,X,,(^n)) (1) 

7r 

where the sum over tt spans the n\ permutation functions, en- 
suring that f{X) is permutation invariant. Bayesian estimation 
can be performed in this framework (conceptually, at least) by 
interleaving prediction and update steps: |J3] p484] 



ft\t-iiX) 
ft\t{X) 



fiZt\X)ftit-i{X) 



(2) 



ft\t-i{Zt) 



«/(^t|X)/,|,_i(X) (3) 



where f^fi{X) is the RFS density at time t given measure- 
ments up to and including time t' ^ff.\f_i{X\X') is the RFS 
transition density, f{Z\X) is the RFS measurement likelihood, 
and the set integral is defined as: (3] p361] 

I g{X)5X ^ g{%) + ^ ^ I g{{xu ■ ■ .,Xn})dxi ■ --dxn 

•' n=l ^' •' 

(4) 
Probability generating functions (pgfs) are widely used in anal- 
ysis of queueing systems, etc, as they greatly simplify anal- 
ysis of independent random variables (e.g., [18]). Likewise, 
probability generating functionalaj (p.g.fls) simplify analysis 
of RFSs that possess independence relationships. The p.g.fl of 
a RFS density / is defined as: |3, p371] 



G[h] 



h^f{X)SX 



(5) 



where h = Yixex h{x). Just as the Fourier, Laplace and z 
transforms are alternative representations of signals, a p.g.fl is 

'We use the convention of capital letters (e.g., X) representing sets, and 
lower-case letters representing single members (e.g., x). 

-We leave off conditioning on the measurement set history Z* = 
(Zi, . . . , Zfi) throughout, as this is implicit in the second subscript /fij'. 

^Note that, whereas the argument of a pgf G{s) is the scalar s, the 
argument of a p.g.fl G[h] is the function h{x). 



an alternative representation of a RFS distribution. If X and 
Y are independent RFSs and Z = X U F, then [3, p372,386] 



fz{Z)^ J2fx{X)fY{Z-X) 



xcz 



(6) 



Gz[h]^Gx[h]-GY[h] 



This convolution identity is analogous to the result that the 
pgf of the sum of two independent variables z — x + y is 
G,is) = Gx{s) ■ Gy{s) |[T8)pl53]. 

This work utilises two basic forms of RFS distributions: 
PPPs and Bernoulli processes. A PPP with intensity function 
X{x) has density and p.g.fl: ||3] p373] 



/(X)=exp|- f X{x)dxY l[X{x) 

G[h] = cxp{X[h ~ 1]} oc exp{A[/i]} 
where A[/i] denotes the inner product of A(-) and ft,(-): 

X\h\^ f h{x)X{x)dx 



(7) 



(8) 



and thus X[h — 1] = J h{x)X{x)dx — J X{x)dx. 

A Bernoulli process with probability of existence r and 
existence-conditioned probability density function (PDF) f{x) 
has density and p.g.fl: 



1 - r, X = 

f{X)=lr-f{x), X^{x} 
[O, otherwise 

G[h] ^1-r + r- f[h] 



(9) 



By (|9]l and (J6]l, we can write the p.g.fl of a multi-Bernoulli 
process, i.e., the process resulting from the union Y = [JiXi 
of independent Bernoulli processes Xi: 



GY[h]^Y[{l- 



^^Mh]) 



(10) 



where r^ and fi{x) are the existence probability and existence- 
conditioned PDF of the i-th Bernoulli process Xi. 

Assumption [T] describes the dynamics model that we utilise 
in this work. 

Assumption 1. The multiple target state evolves according to 
the following time dynamics process: 

• Targets arrive at each time according to a non- 
homogeneous PPP with birth intensity A''(a;), indepen- 
dent of existing targets 

• Targets depart according to iid Markovian processes; the 
survival probability in state x is P^ (x) 

• Target motion follows iid Markovian processes; the 
single-target transition PDF is ft\t~i{x\x') 

Assumption [T] describes a multiple target dynamics process 
which is the union of a PPP (describing arrival of new targets), 
and an independent Bernoulli process for each existing target 
conditioning on the prior existence and state, G[/i|x'] = 1 — 



Thus, the multiple target transition p.g.fl is: |3, p474] 



G[h\X'] 



h"" ft^t-i{X\X')6X 



(11) 



= exp{A'^[/i-l]} n {l-P%x') + P%x')ph{x')} (12) 



x'ex' 



Taking the p.g.fl of (J2]l, changing the order of integration and 
substituting ( [T2| , we obtain the p.g.fl form of the prediction 
equation: ||3j p529] 

Gtit_i[h]= JJ h''ft\t_,{X\X')5Xft_^t-i{X')SX' (13) 



eM>^> - 1]} • Gi_i|i_i[l - P^ + P'ph 



(14) 
(15) 



In analogy to the similar result for pgfs 1 18 pi 50], the RFS 



In analogy with ( [T2| , the p.g.fl of the measurement likeli- 
hood f{Z\X) is gl p422] 

G[g\X]= j g^,f{Z\X)5Z (20) 

= cxp{Af^[g - 1]} \{{l P^ix) + P\x)pg{x)} (21) 

where Pg(x) = /5(z)/(z|a;)dz. Thus: |[3| p531] 

F[gM= j h''G[g\X]ft\t_^{X)5X (22) 

= cxp{Af^[g - 1]} j {h{l - Pd + P%)}^/,|,_i(X)JX 

(23) 
= expjAf'^L? - l]}G,|,_i[/i(l -P'' + P^pg)] (24) 

Thus, using ( [T5] l, ( [T9| ) and (|24]), a Bayes filter may be derived 
directly in p.g.fl form. A key component of the derivation is 
the product rule: ||3] p395] 



distribution can be recovered from the p.g.fl via the iterated 
functional derivative: (T p375-376,384] 
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xex 



Sx 



G[h] 



(16) 



h=0 



G[h+eS^]~G[h] 



where j^G[h] = limj^o '^'"^^°g'~'^^"' , and S^'ix) is a Dirac 
delta function concentrated at a; = x'QThus we can recover 
the p.g.fl of the updated distribution ^ via 13| p530] 



Gtit[h] (X / h''f{Zt\X)ft^t^,{X)SX 
S 

Jz, 

5 



r 11 h''g^'f{Z,\X)ft^t_,{X)SXSZt 
F[g,h] 



9=0 



(17) 
(18) 
(19) 



9=0 



where the derivative with respect to Zf operates on the 
corresponding functional g, and F[g,h] is the joint p.g.fl of 
measurements Zt (functional g) and targets X (functional h). 
Assumption|2]describes the measurement model that we utilise 
in this work. 

Assumption 2. The multiple target measurement process is 
as follows: 

• Each target may give rise to at most one measurement; 
probability of detection in state x is P'^{x) 

• Each measurement is the result of at most one target 

• False alarm measurements arrive according to a non- 
homogeneous PPP with intensity X^^{z), independent of 
targets and target- related measurements 

• Each target-derived measurement is independent of all 
other targets and measurements conditioned on its corre- 
sponding target; the single target measurement likelihood 
is f{z\x) 

'*A measure theoretic exploration of this can be found in |l9| Section 

2.2.5]. 



(25) 

where the notation 1+1 denotes that the sum is over all disjoint 
sets Wo,... ,Wn such that Wq U • • • U W„ = Z. This permits 
calculation of the derivative of a product of several component 
p.g.fls (e.g., a multi-Bernoulli p.g.fl). 

III. Random set filter derivation 

The model of interest was detailed in Assumptions [T] and 
[2] The derivation does not change if the birth density, sur- 
vival probability, detection probability, false alarm density and 
measurement likelihood are time varying (e.g., depending on 
a known, time-varying sensor state); we omit the time index 
from these parameters for notational simplicity. The multiple 
sensor case may be addressed by performing update steps for 
each sensor sequentially in between prediction steps. 

Our derivation is based on an induction process, assuming 
a particular form for the full multiple target probability distri- 
bution, and showing that this form is maintained by prediction 
and update steps. The form of the full multi-target distribution 
at time t, conditioned on measurements up to and including 
time t' < t, consists of two independent components: 



MAX)^ ^/ppp(r)/,i*-(x-y) 



YCX 



GtiAh] = G'^^,r[h]-G'^^r[h] 



(26) 



where fj^^^AX) (and the corresponding p.g.fl G^FJ'lh]) is a 



PPP representing unknown targetsrl/.e., those that have never 
been detected, with intensity X",^,{x): 



/rw«n^*v(^) 



xex 



(27) 



Gmii] <x eMK\Ah]} 



'Adopting terminology from J7J. 



and /t™, °(X) (and the coiTesponding p.g.fl C^^rih]) is a 
multi-Bernoulli mixture (MBM), i.e., a linear combination of 
multi-Bernoulli distributions: 



a.a ieTfit' 

a ieTtH' 



(28) 



where the sum over a spans all functions a : Tt\t' -^ 
{0, ...,n} such that {l,...,n} C a{Tt\t') and if a{i) > 
0, i y^ j then a{i) ^ a{j) (i.e., permutation-like functions 
that map exactly one track index onto each k £ {1, . . . , n}, 
and the remaining track indices onto 0), and 



X 



(i) 



0, a(i)=0 

{xa{i)}, a{i) > 



We refer to the multi-Bernoulli components in ( 28 i as tracks 



since they will tend to collect measurements corresponding to 
a single target (or Bernoulli tracks to emphasise the inclusion 
of a probability of existence). The set of tracks at time t using 
information up to i' < i is denoted by Tt\t' = {1, • ■ • , ?^t|i'}- 
While measurement-to-target association is not explicitly 
introduced in the derivation, we will find that it arises im- 



plicitly, and each term a in the sum in ( 28 1 turns out to 



correspond to a global association history hypothesis, i.e., 
one possible grouping of all past and present measurements 
into subsets, where each subset is hypothesised to correspond 
to a particular target. The source of the association events 
in the derivation is the sum over all decompositions of the 
measurements Zt into the disjoint subsets Wi assigned to 



prior tracks, which appears in the product rule of (25 i. As 



with TOMHT |12|, we decompose these global association 
hypotheses into single-target hypotheses, a — (a^, . . . , a"'i*'); 
each single-target hypothesis is re-used across many global 
hypotheses. As illustrated in Fig. [T] the structure incorporates 
a tree for each track, where the leaves of each tree correspond 
to the single-target hypotheses for that track. The set of single- 
target hypotheses for a given track i G Tt\t' is denoted 
'Hlu, ={!,..., /iju/}. Denoting the set of all past and present 
measurements indices bjrlA^* , each single-target hypothesis 
a* S T~L\u, comprises: 

• the history of measurement indice^ that are hypoth- 
esised to correspond to the target under the hypothe- 
sis, A^* (i,a*) ^ A^* {e.g., the measurement indices 
collected tracing from the root of the tree to the leaf 
corresponding to the hypothesis in Fig. [Til 

• the hypothesis weight w*;", 

• the hypothesis-conditioned Bernoulli multi-target distri- 
bution !l[^!{X) (with p.g.fl GJi^,'[/i]) of the form (9i, 
parameterised by the probability of existence under tne 



*The elements of M^ are of the form (t,j), where j G {1, ■ • ■ ,mt} 
is an index of a measurement in scan t < t' . 

''e.g., if M*'{i,a^) = {(1, 3), (2, 5), (4, 7)} at time t' = 4, then 
a* hypothesises that the target corresponding to track i is associated with 
measurement 3 at time 1, measurement 5 at time 2 and measurement 7 at 
time 4 (implying that a missed detection occurred at time 3). 



hypothesis r*;", , and the hypothesis- and existence 

conditioned PDF /^'° (x) 
The global hypothesis weights w° are related to the single 
target hypothesis weights via the expression 



w. 



t\t' 



(29) 



where a = (a\ . . . ,a"'i*'). As opposed to the TOMHT, 
each global hypothesis incorporates a single-target hypothesis 
for every track. Since the hypothesis-conditioned distribution 
f^!^, {X) is Bernoulli, the probability of existence under 
different single-target hypotheses will vary, thus through the 
multi-Bernoulli distribution, the fixed set of Uf^fi Bernoulli 
tracks can represent different numbers of targets. 



In (28 I, the sum over a spans all valid global association 
hypotheses, i.e., all hypotheses that associate each measure- 
ment in the history A^* with exactly one track. Ultimately, 
we will seek to avoid explicitly enumerating global association 
hypotheses. To permit this possibility, we replace this with 
a sum that spans the Cartesian product of the single-target 
hypothesis sets for each track, i.e.. 



nia'&ni,,} 



ae H ni^,,={{a',...^ 
The sum over this set can be made equivalent to ( [28] l by 

(30) 



modifying ( 29 1 to 



w (X et 



(«) n 



w. 



t\t' 



leTt,. 



where e^ (a) = 1 for any valid global association hypothesis, 
and et'{a) = otherwise, e.g.. 



et'{a) = 



0, 



1, 



3i^i' s.t. M''{i, a') r\M*'{i', 
or [J.^M*'{i,a')^M*' 
otherwise 



)^ 



(31) 



Note also that many of our expressions {e.g., (27 1, (30i) 
claim proportionality but not equality. This unambiguously 
characterises the posterior distribution through the knowledge 
that the density (and each of its independent components) must 
integrate to unity, or, equivalently, that Gf|(/[1] = 1. This is 
a simple matter for the PPP component, but it does not need 
to be performed since we do not extract estimates from this 
component (since outputting an estimate on a target that has 
never been detected would be nonsensical). Normalisation of 
the MBM component is achieved by ensuring that J^a ^" ~ 1' 
where w°- follows (30 1. In practice, this is accounted for within 



the approximate inference method outlined in Section IV-C 



A. Prediction step 

Theorem [T] derives the prediction step by substituting ( 26 1 
into ( 15 I. The theorem confirms what one would intuitively 
expect: that the prediction of the PPP component ( [32| follows 
the standard PHD prediction step of |1 20], and that the multi- 
Bernoulli tracks are predicted independently in an equivalent 
manner to the MeMBer 1 3|21) (excluding birth of new targets. 



since we utilise a PPP birth model as opposed to a multi- 
Bernoulli birth model). 

Theorem 1. Assume that the distribution from the previous 
time step Gt_i\t_i[h] is of the form given in t26\, [27), (2(Si, 
(50L Then the predicted distribution for the next step Gt\t~i [h] 



Time 1 (t\i'o measurements) 



is of the same form, with: 



AJ^,_i(a;) = X\x) + / ft\t^,{x\x')P%x')XUitM^')dx' 



w. 



H\t-1 
't\t~l 
^t\t-l 



H-l\t-li 



H\t-l 



K-i\t-i V * 



w. 



t-i|t- 



-^ V z, a* 



(32) 
(33) 

(34) 



' t-i|t-i 



^'(a;')/t-i|t_i(a;')dx'Vz,a^ (35) 



..^,, Ut\t-.{xW)nx')f-%_,{x'W ^ 

Tt\t-iix) = z — V I, a 



J p^{x')f:f,u,{x')dx' 



(36) 



The proof of the theorem is in Appendix [A] 

B. Measurement update step 

Theorem l2] derives the measurement update step by sub- 
stituting ([24| and (26 1 into (fT9]l. In Lemma [T] we prove the 
preliminary result that the posterior remains the union of an 
independent PPP and a second distribution which turns out to 
be a MBM. The PPP intensity update equation is ( [37| ; this is 
identical to a PHD update with no measurements |1|20| . The 
MBM update is illustrated and described in Fig. [l] 

Theorem 2. Assume that the predicted distribution Gt\t-i[h] 
is of the form given in ( |26p , ( |27p , pSl), pOl). Then the updated 
distribution Gt\t[h] (updated with the measurement set Zt — 
{z}, . . . , z™'}j is of the same form, with iiiu = riiu-i + '™t> 

Xt\,{x) ^ {I - P\x))\l,_,{x) (37) 

M' = M'-' U {{t,3)\j e {1, . . . ,mj} (38) 

For tracks continuing from previous time steps (i € 
{1, . . . , n(U_]^}j, a hypothesis is included for each combi- 
nation of a hypothesis from a previous time, and either a 
missed detection, or an update using one of the rrit new 
measurements, such that the number of hypotheses becomes 
hlu — ft.!u j^(l + JTJt). For missed detection hypotheses 



t\t t\t- 

(ie {l,...,nt\t-i}, a' G {1, ■ 



H\t~l 



})■• 



w. 



t\t 



' t\t 



w. 



t\t- 



-l(l 



' t\t- 



i.a ri.a r-, _ p, 

1 ^ ' t\t-lJ t\t-n^ ^ 



I. a 



't|t- 



-,nCt-A^-p'\ 



' t\t-i 



''t\t~ 



JlitA^-n 



ft\t (^) 






(39) 
(40) 

(41) 
(42) 



For hypotheses updating existing tracks (i £ {1, . . . , J^tit-i}, 



''t|t- 



_ii, a^G{l,-. 



, ft.J|t_J, j e {!,..., mt}, i.e.. 



Track 2 (new) 



Track 1 (new) 



I.e. (1,1) n.e. (12) 

Time 2 (two measurements) 

Track 1 Track 2 

.(1,1) /\(1,2) 



n.e. = non-existent 

m = missed detection 

(t,j) = time t measurement _/' 



Track 3 (new) 




n.e. m (2.1) (2,2) n.e. m (2.1) (22) n.e. 

t 1 L 



(2,1) n.e. 



Track 4 (new) 



(2.2) 



example of a valid global hypothesis 

Fig. 1 . MBM .structure maintained by filter. Structure after time 1 i.s shown at 
the top (assuming that there were two measurements). A new track is created 
for each measurement; this results from updating the PPP of unknown targets 
with each measurement. The new tracks each contain two hypotheses; one 
hypothesising that the measurement goes with a previously existing track and 
hence that the new track is not required (following eq p8l), and the other 
hypothesising that the measurement goes with the new track, capturing both 
the possibility that it is the result of a target detected for the first time, or 
a false alarm (following eq J49j-J52l). The structure after time 2 is shown 
at the bottom (assuming that there were also two measurements in the new 
scan). Each track from the prior distribution is continued, incorporating a 
hypothesis for each prior hypothesis (marked with "m", coiresponding to a 
missed detection, following eq J39}-J42}), and for each combination of a prior 
hypothesis and a new measurement (marked with "(i,i)", corresponding to 
the prior hypothesis being updated with the measurement, following eq J43l- 
J46l). Non-existence hypotheses are continued without branching. A branching 
structure similar to TOMHT results. New tracks are created for each new 
measurement (marked with "new"), following the structure described for the 
first time step; again, these con'espond to updating the PPP of unknown targets 
with each new measurement. The gray line shows an example of a global 
hypothesis-/, e., a choice of one single-target hypothesis from the tree for 
each track, in which every measurement in every scan is used exactly once. 
Each term in the sum in J28j corresponds to a global hypothesis. 

the previous hypothesis a*, updated with measurement zi)y\ 
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Finally, for new tracks, i € {jT-tif-i + j}, j € {1, . • . ,mt} 
(i.e., the new track commencing on measurement zj), 

K\t = 2 (47) 

X*(i,l) = 0, will ^ I, rl^l=0 (48) 

M'{t,2)^{{t,j)} (49) 

wlf^^X'^izi) + \l_,[fizi\-)P''] (50) 
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(51) 



(52) 



A hypothesis at the previous time with '"^L, = need not be updated 
since the posterior weight in j44^ would be zero. For simplicity, the hypothesis 
numbering does not account for this exclusion. 



Note that we have not specified the distribution f^L (x) 
for new target tracks. Since r^L — {i.e., the probability of 
existence is zero), the value of this distribution has no effect. 
In Lemma [T] we prove the first piece of Theorem l2j i.e., the 
structural result that Gf|t[/i] remains a union of an independent 
PPP and some other distribution (which will subsequently be 
shown to be a MBM). 



Lemma 1. The updated distribution Gt\t[h] has the form 



where 



G,|,[/.] = GPf[/.] 



GPfMcxexplA^J/.]} 
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Ginbii 
t\t 
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t\t l"-J ^ ^^i'\^H\tV 
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(53) 



(54) 



(55) 



3=0 



and A^j(a;) is given in \37\. 



The proof of Lemma [T] is in Appendix IS] Lemma [T] shows 
that, in the updated distribution, the portion of the component 
^t'u- 1 1'^] '■^^'- ^^ '^°'- detected is independent (in the posterior) 
of the remainder of the distribution (which comprises the 
portion of G['|j'^j^[/i] that is detected, and both detected and 
undetected portions of the component G"^'^_"5^[/i]). Note that 
this is not claiming that the entire undetected portion is 
independent of the detected portion. The proof of Theoreml2]is 
in Appendix [B] The updates in (|42]), ( |46| and ( [52| ) are standard 
single target measurement updates, and thus can be calcu- 
lated with well-known methods {e.g., |22|). The derivation 
incorporates hypotheses updating every prior hypothesis with 
every measurement; however, in practical implementations, 
gating can be used to reduce the computational burden by 
excluding hypotheses with negligible weights. Other standard 
approximations such as clustering, mixture reduction, etc are 
also required in practice. 



C. Initialisation 

One significant benefit of the inclusion of a Poisson com- 
ponent is in initialisation of the tracker. Generally, when 
a surveillance system is first activated, the fact that prior 
measurements are not available does not imply that no targets 
are present. The Poisson distribution provides a convenient 
mechanism for specifying a prior distribution on the number 
and position of targets when little information is available. 
By initialising ?t.o|o = {i.e., no MBM components), and 
setting AQ|g(x) — Ao/o(a;) where Ap is the expected number 
of targets present, and fo{x) is the (presumably diffuse) 
prior state distribution, the Bayes prediction and update steps 
incorporate this knowledge, adjusting the accrual of confidence 
in the existence probabilities r^l" accordingly. In [23|, it was 
demonstrated that this approach can considerably improve the 
speed of track initiation in low signal-to-noise ratio (SNR) 
environments. 



IV. Marginal association filters 

The previous section derived a conjugate prior form for the 
full Bayes RFS filter. Not surprisingly, the filter is intractable, 
and approximations must be made. Noting its similarity to the 
PHD, the PPP component of the posterior is tractable {e.g., see 
fSSll). The difficulty is the number of global association history 
hypotheses, each of which gives rise to a term in the sum in 



the MBM (28 I. To motivate our approximation, consider the 
definition of a probability of joint association history: 



Pt\t{a) ^ w'' ^ et{a) J] 



"t\t 



(56) 
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following which ( 28 1 can be equivalently written as a total 
probability expansion 



/*if"w-E^*it(«) n fi 



I. a 
t\t 
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(57) 



ieTtu 



The form ( [28] l was chosen to elucidate this equivalence, 
which in turn suggests methods that make approximations to 
the discrete distribution Pm{a). The methods in this section 
approximate the posterior distribution f"^^'^{X) by directly 
approximating the discrete probability distribution of global 
association history hypotheses Pj|((a). Herein we will assume 
that the prior distribution f^{fl_i{X) is multi-Bernoulli, such 
that the predicted distribution f^^^{X) is multi-Bernoulli, 
and our task is to gain an approximation of f^^'^'^{X) that is 
multi-Bernoulli (since the true distribution is not). A distri- 
bution in the form ( |28] l will be multi-Bernoulli if there is a 
single association hypothesis, e.g.. 






t\t 

't\t 



1 = 1 Vi, 
lVi,a* 



(58) 



In this case, we refer to the weight, existence probability and 
PDF for track i as wjij — 1, r^^ and flu{x) respectively. 

A. Track-oriented marginal MeMBer-Poisson filter 

Suppose we seek an approximate representation of the dis- 
tribution Pt|f(a) in which the hypotheses for each track a' 
comprising the global hypothesis a = (a^, . . . , a"'i*) are 
forced to be independent, i.e., a distribution of the form 

Pt\tia) « n '^'("') (59) 

i£Tt\t 

Then with no further approximation, the joint multi-target 
distribution in ( [ST] ) becomes: 

(60) 

fl^^(x)= — - -~^^ .^-^ 



E I'i^iftifix) 



which is multi-Bernoulli. The problem of finding the choice 
of q^{a^) which best fits the assumed form of the distribu- 
tion can be solved by minimising the Kullback-Leibler (KL) 
divergence: 



[«"(«')] 



ieTtu 



arg min D 

g»(a*),iGTt|t 



Pt\tia) 



n 
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(61) 



where D{p\\q) = Ea-P(a)log^^ 



q{a) 



It is shown in |24 



p277] 



that gr"(aO = Pt*|t(a*) = Ea|a'=a' ^t|t(a), /.e., the marginal 
association distribution for track i. 

Accordingly, the data carried forward to the next time 
for the MBM component are simply the single hypothesis 
Bernoulli distributions f\^{X) (parameterised by an existence 
probability, and a position distribution consisting of a weighted 
mixture of the old hypothesis-conditioned distributions). We 
refer to this as the track oriented marginal MeMBer/Poisson 
filter (TOM-MeMBer/P), as it forms tracks out of the marginal 
track-to-measurement association distributions. The approxi- 
mation made in the TOM-MeMBer/P is exactly that made in 
JPDA: approximating thejoint association distribution by the 
product of its marginals|j Whereas JPDA assumes target ex- 
istence, applying the marginal approximation within the RFS 
framework yields an algorithm that naturally captures target 
existence. In what follows, we show that the TOM-MeMBer/P 
is equivalent to a particular variant of JITS/JIPDAP' with one 
minor modification. The specific variant is JITS/JIPDA with 
a parametric clutter model, and using the one-point track 
initialisation method incorporating a new object spatial density 
|[l0] p324]. We assume that P^{x) = P^ and P''{x) = P"" {i.e., 
that both of these parameters are spatially constant). From 



(56 1, (40 1, (44 1, (50 1 and (58 i, the probability of a global 



association hypothesis a is: 
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Since, under any hypothesis, a measurement will either be 
explained as belonging to an existing track (3 i s.t. a* — j), or 
a new track (a"'i*-i+^ = 2) (which incorporates the possibility 



of it being a false alarm), we can divide ( 62 1 by the constant 
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(63) 



which differs from pO] pl79] only in the inclusion of the 
term P'^X^,^__-^[f{zf |-)] in the denominator. Thus it is seen 
that JITS/JIPDA neglects the influence of unknown targets 
in the calculation of probabilities of association events. From 

' JPDA/JIPDA additionally approximates the posterior distribution of each 
target as being Gaussian. 



p^ , ( [5T| and (|60]l, the posterior probability of existence of 
I track started oi 
j e {!,..., rnj) 



a track started on a measurement Zf is (i — nt\t-i + J 
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(64) 



This is exactly 1 10 
P^X'tu .\fiz\-)]. Thus, 



p325, (9.13)], with Aaiget(2) = 

track initiation in the variant of 



t|t-i[/(^. .. 

JITS/JIPDA modelling a birth intensity ptarget(z) can be seen 
to be equivalent to TOM-MeMBer/P assuming a stationary 
intensity of unknown targetsPj The benefit of dynamically 
estimating this time-varying quantity was demonstrated in 
123 1 . Accordingly, we have shown that after two minor mod- 
ifications and the approximation of ( [60| , JITS/JIPDA may be 
derived (and extended) using the RFS formalism. Similar al- 
gorithms have also been proposed within the RFS community; 
for example, |25| updates existing tracks in a similar manner, 
but uses a two-point differencing approach to calculate initial 
states, weighting the new target hypotheses using a constant 
density of new targets (motivated as a truncated Poisson); and 
1 26 1 considers a bank of joint target-detection and tracking 
(JoTT) filters interacting through modified weights. 



B. Measurement oriented marginal MeMBer-Poisson filter 

The most striking difference between conventional filters 
and the MeMBer filter | [3l5| is that the MeMBer collects 
aU hypotheses corresponding to a particular measurement in 
the most recent scan into one track {i.e., multi-Bernoulli 
component), splitting hypotheses from prior tracks among 
the new tracks. This is the opposite of the TOM-MeMBer/P 
(and most conventional trackers), which favours continuity 
of historical components over the constraints relating to the 
most recent measurement scan. An alternative MeMBer fil- 
ter can be obtained by using a different marginalisation of 
(62) Pt\t{o) that follows this philosophy. We refer to the result as 



t\t 

the measurement oriented marginal MeMBer/Poisson (MOM- 
MeMBer/P) filter. For consistency, we continue to use the term 
track to refer to multi-Bernoulli components although, unlike 
the TOM-MeMBer/P, there is no correspondence of tracks 
between time steps. 

The MOM-MeMBer/P is based on an alternative param- 
eterisation of association hypothesesFH Rather than the pa- 
rameterisation a = (a^, . . . , a"'i*) where a* indexes the mea- 
surement(s) associated with track i, we formulate hypotheses 
as 6 = (6^,..., 6™') where V indexes the track associated 
with measurement j, e.g., IP = i indicates that measurement 
j is associated with track i. Under the assumption in (58i, 



b is an equivalent parameterisation of the global association 
hypotheses. However, it gives rise to an alternative approx- 
imation of the distribution. Specifically, rather than seeking 
the best-fitting distribution that enforces independence in the 
association variables for different tracks (i.e., ([M)), we seek 

'"We also incorporate the kinematic distribution of targets into the initial 
update in j52|; this distribution is assumed to be uniform in 1101. 

"For clarity of explanation, we initially omit missed detections, new 
targets and false alai'ms. 
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Fig. 2. Scenario involving two targets in one dimension. Prior multi-Bernoulli 
components are shown in (a) (solid is i = 1, dashed is i = 2), along with 
measurements ("x"; left is j = 1, right is j = 2), The two-target joint prior 
distribution is shown in (b). Result of updating each component with each 
measurement is shown in (c) (blue is j = 1, green is j = 2). Joint two- 
target posterior is shown in (d). MeMBer approximations earned forward by 
TOM-MeMBer/P and MOM-MeMBer/P are shown in (e) and (f) respectively. 



the best-fitting distribution that enforces independence in the 
association variables for different measurements: 



Pt\t{b)^l[q'*{b' 



W*it^)]^ 



arg mmD[ Pt\t{h) 

ijJ(6J) 



\{q'{h^ 



(65) 



where, again following |24 p277], q^*{V) = PL{V) 



Eb|63=bJ Pt\t{b) is the marginal probability that measurement 
j is associated with track i. 

As an example, consider the case in Fig. |2] involving two 
targets and two measurements, neglecting the possibilities of 
new targets or missed detections. The prior distribution for 
each Bernoulli component is illustrated in (a). The correspond- 
ing two-target joint distribution is shown in (b); this is the 
symmetrisation of the distribution f^{x])p{x'f) (leaving off 
conditioning on prior measurements for simplicity), i.e.. 

The hypothesis-conditioned updated distributions for each 
prior marginal, updated with each measurement are shown 
in (c), weighted by the association likelihood w*-' — 



j f{zl\x)f^{x)dx. The exact posterior for two targets is 
shown in (d); this is the symmetrised version of the distribution 

w'^'w''^f{x\\zl)f{xl\z',)+w'^^w'^'f\x\\z^,)f{xl\zl) 

The posterior approximation used by the TOM-MeMBer/P 
(and JIPDA, etc) is shown in (e); this is a symmetrised version 
of the distribution 

[p'^'f\x\\zl)+p'-'f'{x\\zl)\- 

'f{xl\zl)+p'''f{A4)] 



where p^'^ — P^uia^ — (^ij)) = Pt\t^^ = *) i^ '■^^ marginal 
probability that measurement j is associated with track i. The 
previous section showed this to be the best approximation in 
which the association variables for different tracks are forced 



to be independent. In ( 67 1, we will see that the approximation 



obtained by the measurement-oriented marginalisation in ( 65 1 
is a symmetrisation of 

[p'^'f{x\\zi)+p--'nx\\zi)\- 

[p'''f{xl\z^,)+p'''f{xl\z^,)] 

as illustrated in (f). In this instance, the MOM-MeMBer/P 
approximation clearly provides a better representation of the 
true distribution. 

Before proceeding, we expand our notation to admit new 
targets, false alarms and missed detections. Since we have 



assumed that the prior distribution was multi -Bernoulli (58 1, 
posterior hypotheses will be either A^ *(«,«*) = (the target 
was missed) or A^*(i,a*) = {(i,j)} (measurement zl corre- 
sponds to target i). This set of hypotheses can be characterised 
equivalently via a set of hypotheses h = (6^, . . . , &"*«+"« |t-i) 
where for j e {1, . . . ,mt], W = if measurement j is 
hypothesised to not correspond to a previously detected target, 
and V — i\f measurement j corresponds to pre-existing target 
i, and for j = m^ + i, i G {1, . . . , nt\t-i}, 6™*+* e {0, z}, 
where 6™*+* = i if track i is missed (and otherwise). The 
global hypotheses a and b are alternative parameterisations of 



the same event. Thus ( 57 1 can be written equivalently as 
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where, in analogy with ( [3T| , 

fO, 3 jV j' s.t. 6J = &J' > 
e„(6) = <! or 3 i e {1, . . . , 7i} s.t. W ^ i\/ j 
[l, otherwise 

and I3{b' ,j) maps to the hypothesis and track corresponding 

to y-. 

, if V = i e {l,...,nt\t-i} for j e {1,._^^, mj then 



P{V,j) maps to the hypothesis in (43i-(46i updating 
track i with measurement j 

if V — for j G {!,..., nit} then f3{b^ ,j) maps to the 
new target hypothesis in (|49ll-([52)i 



• if V = i for j ~ rrit+i then /3(&^ , j) maps to the missed 
detection hypothesis in (|39|-(|42]) 

• if fr' = for j — nit + i then f3{V ,j) maps to the "non- 
existence" hypothesis, i.e., (J48]l 

Subsequently, we obtain an alternative approximation based on 
the representation of Pt\t{b) via the product of its marginals: 
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Again, with no further approximation, the joint distribution is 
multi-Bernoulli: 
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We have thus obtained an alternative form of update for a 
filter that is structurally similar to the MeMBer |3 5|. The only 
approximation made in the derivation is the approximation of 
the joint association distribution Pt\t{b) as the product of its 
marginals p^ 

The difference between the tracks formed by the TOM- 
MeMBer/P and MOM-MeMBer/P is illustrated in Fig. |3] 
In the TOM-MeMBer/P, there is a track in the posterior 
corresponding to each prior track collecting all hypotheses 
updating that track, and a track in the posterior for each 
measurement considering the possibility of the measurement 
being a new target. In the MOM-MeMBer/P, there is a track 
for each measurement collecting all hypotheses updated by 
the measurement (collecting hypotheses from all prior tracks), 
and a track for each prior track containing only the "missed 
detection" hypothesis. 

Detailed pseudocode for both TOM-MeMBer/P and MoM- 
MeMBer/P can be found in Appendix |C] 

C. Approximating marginal association distributions 

The greatest barrier to practical application of the TOM- 
MeMBer and MOM-MeMBer algorithms is the calculation of 
the marginal probabilities of association PJ:,^{a'') and Pl^{V), 
which is closely related to the #P-complete calculation of a 
matrix permanent. The present work was motivated by the 
emergence of a new approximation to this calculation that is 
both accurate and tractable |14|. The approximation is based 
on loopy belief propagation (LBP), and the particular model 
used has been studied recently in machine learning, physics 
and information theoryPj While LBP was discovered as a 
heuristic extension of the standard forwards-backwards algo- 
rithm for Markov chains to probabilistic graphs with loops, it 
has since been understood as a variational {i.e., optimisation- 
based) inference method with roots in statistical physics LZTJ. 

'-Clearly, practical implementations will require additional approximations 
in pruning unlikely tracks, reducing the number of components in the 
mixture-conditioned distributions, and in calculating the marginal distributions 
P/i (b-') themselves. 
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Fig. 3. Example of marginal distribution of track-measurement association, 
and Bernoulli components formed by TOM-MeMBer/P (collecting columns) 
and MOM-MeMBer/P (collecting rows). Each cell of the table contains 
the marginal probability that a given measurement is associated with a 
given track (which sums over the same global association hypotheses as 
the probability that the track is associated with the measurement). In the 
example, there are two continuing tracks and three new measurements; this 
results in five posterior tracks (i.e., Bernoulli components). Using the TOM- 
MeMBer/P, these correspond to the two existing tracks, and a new track 
for each measurement. Using the MOM-MeMBer/P, a posterior track exists 
for each measurement, collecting events from all prior tracks that use the 
measurement. The remaining posterior tracks hypothesise missed detection of 
the corresponding prior track (and non-existence otherwise). 



While there are few cases in which the performance of LBP 
is guaranteed, its practical performance in many problems is 
remarkable; for example, turbo decoding has been shown to 
be an instance of LBP | [28| . While LBP is not guaranteed to 
converge in general, its convergence in the model of interest 
is proven in |14|, and the underlying optimisation problem is 
shown to be convex in p3] . Details of the algorithm, the accu- 
racy of the marginal estimates that it yields and the number of 
iterations required for convergence are studied in detail in [14J ; 
detailed pseudocode for the algorithm and its employment in 
the TOM-MeMBer/P and MOM-MeMBer/P can be found in 
Appendix IC] Note that the filter derivation above is not tied to 
the LBP approximation; any other method such as Maskell's 
exact EHM |29|, MCMCDA f30l, or LMIPDA flOTI | may 
also be used to calculate (approximately or otherwise) the 
marginal association probabilities. 

V. Contributions and relationship to other work 

The relationships of the proposed algorithms to the 
JITS/JIPDA and MeMBer filters were discussed in Sections 
|IV-A| and |IV-B| As discussed in Section |lll) the structure of 
the MBM component of the distribution is similar to that of 
the TOMHT (in its use of a hypothesis tree for each target, 
implicitly representing global association hypotheses). At each 
time, MHT methods seek the most probable global association 
hypothesis, which corresponds to the term in the sum ( 28 1 



with the largest weight. The derivation in Section nU] retains 



t\t^ 



See 14 15 for references. 



separability of the weights (30i such that efficient methods 
such as Lagrangian relaxation could be applied to obtain a 
RFS-based TOMHT; these were applied in 1 16|32|. It has been 



10 



suggested that the exphcit modelUng of associations in MHT 
may induce a bias into the solution (e.g., |3, p340]). While 



the derivation in Section III showed that association naturally 
arises in RFS filters, it appears as a total probability expansion 
([28]l. Thus the derivation does not validate approaches such as 



MHT that replace the sum over a in ( 28 1 with a maximisation. 
The derivation is related to | 33|34| , which was developed 
simultaneously and independently. On a theoretical level, the 
primary difference is that the latter papers study the labelled 
case (rather than the unlabelled case studied herein). The 
labelled case has obvious benefits in its inherent ability to 
maintain continuity between time steps. On a practical level, 
both methods effectively involve a total probability expansion 
over an intractable number of terms (i.e., global association 
hypotheses). The labelling in |33| increases the number of 
terms, since unknown targets from the Poisson birth process 
must be approximated via additional components in the sum. 
The filter in |33| operates by carrying a fixed number (N) 
of terms from one time step to the next. The update step then 
applies Murty's algorithm N times to find the N terms with the 
highest weight, truncating the sum to contain only these terms. 
This is algorithmically similar to the MHT implementation in 
p5|. While this approach avoids effects such as coalescence, it 
limits the number of global association hypotheses that can be 
represented. For example, if our marginal filter approach main- 
tains twenty tracks, each with ten single-target hypotheses, 
collectively 10^" global hypotheses are implicitly represented 



(with the error in weights induced by the approximation (59i); 
explicit representation of this number of terms is obviously 
intractable. This can be expected to limit the performance of 
the approach in |33| in low SNR environments (i.e., low P^, 
high Af^). 

The concept of maintaining a representation of targets that 
have never been detected is uncommon but not new; e.g., 
see |[8|. While this aspect of the filter has not been a focus 
in this paper, it yields significant benefits including in the 
speed of initiation during the initial transient when the tracker 
commences and in environments with non-stationary sensor 
coverage |23|. Furthermore, it is shown in |23| that multi- 
Bernoulli tracks with a low probability of existence can be 
efficiently represented via the Poisson component, greatly 
reducing the complexity associated with initiating tracks in 
high false alarm environments. The adaptive clutter estimation 
approach of |36| may also be incorporated tractably via the 
Poisson component. 

VI. Results 
We demonstrate the performance of the proposed ap- 
proaches in a challenging scenarios involving n E {6, 10, 20} 
targets which are in close proximity at the mid-point of 
the simulation, achieved by initialising at the mid-point and 
running forward and backward dynamics. We consider two 
cases for the mid-point initialisation (i.e., t ~ 100): 

Case 1: xioo ^ A/'{0, 10~^ x I4X4} 
Case 2: xmo ^ ^f{0, 0-25 x I4X4} 

where the target state is position and velocity in two di- 
mensions. Snapshots of one dimension of both cases are 



shown in Fig. |4] It is well-known that JPDA and related 
algorithms suffer from coalescence, i.e., after targets come in 
close proximity, mean estimates remain on the mid-point of 
the targets for a significant duration (e.g., p7[). In case 1, tar- 
gets are completely indistinguishable at the mid-point, hence 
coalescence-like effects are at their worst. Coalescence-like 
effects also occur in case 2, but to a lesser extent (mainly due 
to the discernible difference in velocity). In case 1, targets all 
exist throughout the simulation (tracks are not pre-initialised). 
In case 2, the targets are born at times {0, 10, ... , 10(?i — 1)} 
(any targets not existing prior to time t = 100 are born at that 
time; consequently, for case 2 with n — 20, ten targets are 
born at time t = 100). Targets follow a hnear-Gaussian model 
with nominally constant velocity, Xt — Fxt_i + Wt, where 
u;t-AA{0,Q}, 
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and q = 0.01, T = 1. Target-originated measurements provide 
position corrupted by Gaussian noise, i.e., zt = Hxf + vt, 
where H = [ 1 ] (g) 12x2. and Vt ~ A/'{0,l2x2}- The 
initial unknown target intensity for the proposed methods 
is assumed to be AQ|g(.T) = 10A/'{x; 0, P}, where P = 
diag[100^, 1, 100^, 1] covers the position and velocity 
region of interest. The birth intensity uses the same covariance, 
A''(a;) = O.OSA/'jx; 0, P}. Cases are considered with the 
average number of false alarms per scan is A''' G {10,80}, 
and their distribution is uniform on [—100,100]^. All fil- 



ters assume P'^ 



0.999. We consider cases with P e 



{0.3, 0.5, 0.7, 0.98}, representing a range of SNR values. 

We compare the proposed methods to CPHD |3 4|, CB- 
MeMBer g, JITS and Unear multi-target ITS (LMITS) 
1 10|11 1 (a variant of JITS which replaces the marginal asso- 
ciation probabilities with a heuristic approximation). The CB- 
MeMBer approximates the initial distribution by 30 identical 
Bernoulli components with r ~ 1/3, and uses a single birth 
component with r = 0.05 at each time. JITS and LMITS 
truncate the unknown target intensity to use ptarget(^) = 
P'^^UiA')]- The implementation of JITS initiates a new 



track on every measurement (via ( 64 1), but only updates new 
tracks with measurements that are not in the gate of any 
hypothesis of an existing track. Exact marginal association 
probabilities were calculated using the junction tree algorithm 
described in 1 14:|, which in turn uses the library for discrete ap- 
proximate inference p8) . The exact calculation is not possible 
for the simulations with n > 6 targets, or with A^** > 10. Since 
the weights provided by LMITS do not provide a probability 
that a measurement is not used by any other track, the LMITS 
implementation initiates tracks only on measurements that are 
not within the gate of any hypothesis. Where applicable, all 
tracks with r^V > 10^"* are maintained (the threshold needed 
to be increased to 10^'^ for JITS to be able to successfully ex- 
ecute). All methods utilise Gaussian mixture implementations, 
with hypothesis management performed using n-scan merging 
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20 40 60 



100 120 140 160 ISO 200 
Time 



Fig. 4. One dimension of a single Monte Carlo run of the scenario cases 
1 and 2. Target trajectories shown in colours, and measurements shown in 
gray. Both use P'^ = 0.7; top and bottom have the expected number of false 
alarms set to 80 and 10 respectively. 

||39^|, with n = SJ^All methods tested share the same fihering 
and hypothesis management code, so run times are somewhat 
comparable. All code was written in MATLAB except for the 
C++ junction tree routine used for JITS |3F|. Clustering is 
used only in the JITS marginal probability calculation. The 
TOM-MeMBer/P, JITS and LMITS implementations output 
estimates^ for all tracks with existence probability > 0.8, 
while the remaining methods find the mode of the cardinality 
distribution (excluding the Poisson component for the MOM- 
MeMBer/P), and output estimates^ for the corresponding 
number of tracks with the highest probability of existence 
(or, in the case of CPHD, the Gaussian components with 
highest weight). The results are shown in Fig. l5] the figure 
shows the mean optimal sub-pattern assignment (OSPA) [401 
with p = 2 and c = 10 (calculated over both position and 
velocity) as a function of time, averaged over 200 Monte Carlo 
trials. Average run times are shown in Table |l] The following 
observations are made: 

• The effect of coalescence is clearly visible from t — 100 
to the end of the simulation. JITS, TOM-MeMBer/P 
and LMITS are affected similarly, but to varying ex- 
tents. Interestingly, LMITS is affected less than TOM- 
MeMBer/P, which is affected less than JITS. The differ- 
ence in each of these is the approximation of the marginal 
association probabilities used — the approximation used 
by LMITS is shown in y4j| to be inferior to the LBP- 
based method used for the TOM-MeMBer/P, whereas 
the JITS uses the exact values. Counterintuitively, the 
poorer approximations perform better. This suggests that 
the approximations tend to be less equivocal between 
the competing hypotheses than the exact probabilities. 

'"* Histories are only considered to be identical if the last three measurement 
updates are the same — i.e., missed detection events are not considered to be 
an element in the history. A maximum of 2000 hypotheses are retained (in 
total), and hypotheses with w^'.'f r'|" < 10~^ are deleted. 

"The estimate is the mean of the Gaussian component with the highest 
weight. 



TABLE I 

Average execution time for a complete MC trial (in seconds) 
FOR TOM-MeMBer/P, MOM-MeMBer/P, jits, LMITS, CPHD AND 

CB-MEMBER in various SCENARIOS. FIGURES IN PARENTHESES 

INDICATE THE NUMBER OF MC TRIALS THAT FAILED (FOR JITS) DUE TO 

MEMORY RESTRICTIONS. 



Case, n 


P^ A*- 


TOM- 


MOM- 


JITS 


LM- 


CPHD 


CB- 


1, 6 


0.98, 10 


9 


10 


7(6) 


7 


20 


15 


1, 6 


0.7, 10 


25 


26 


18 (6) 


12 


41 


37 


1, 6 


0.5, 10 


32 


31 


23(7) 


15 


49 


42 


1, 6 


0.3, 10 


43 


41 


27(8) 


17 


55 


51 


1, 10 


0.7, 10 


59 


58 


NA 


37 


69 


71 


1, 20 


0.98, 10 


128 


126 


NA 


117 


117 


142 


1, 6 


0.7, 80 


51 


54 


NA 


22 


121 


55 


2, 6 


0.98, 10 


4 


5 


3(6) 


4 


17 


8 


2, 6 


0.5, 10 


17 


18 


9 (7) 


8 


34 


29 



Various methods exist for explicitly resolving coalescence 
(e.g., | 37|4H ), but all suffer from similar exponential 
complexity to the exact marginal association calculation. 
Tractable heuristic approaches were proposed in the pre- 
prints 1 16 17 1; work to produce a tractable yet principled 
approach using variational inference is underway. 
CPHD, CB-MeMBer and MOM-MeMBer/P all exhibit 
good robustness to coalescence effects. They also react 
more quickly to cardinality changes due to the use of 
the cardinality mode estimator rather than component-by- 
component thresholding. Where coalescence is not domi- 
nant, all of these methods exhibit reduced performance in 
comparison to the various track oriented approaches. The 
reduction in performance larger in lower P^ cases. The 
MOM-MeMBer/P outperforms CPHD and CB-MeMBer 
throughout, especially in the low P'^ cases. The primary 
error in CPHD and CB-MeMBer in these cases is the 
so-called "spooky effect" 1 34|42[ , i.e., in cases in which 
one or more targets are not detected, their positions are 
estimated to be close to other targets that are detected, 
regardless of target spacing. Thus, as described in ||42), 
the global cardinality is estimated accurately but the local 
cardinality is inaccurate. Neither TOM-MeMBer/P nor 
MOM-MeMBer/P exhibit the same difficulty. 
With P"^ = 0.3, TOM-MeMBer/P, MOM-MeMBer/P and 
CB-MeMBer all maintain around 200 tracks on average, 
whereas JITS and LMITS maintain 20-30. This is the 
consequence of the track initiation logic necessitated by 
JITS and LMITS, and the higher track deletion threshold 
for JITS (which was necessary to successfully execute 
the algorithm); it directly leads to the degradation in 
performance of these methods compared to the others. 
Thus the apparent lower complexity of JITS in Table IT] is 
the result of its solving a much smaller problem, which 
leads to its lower performance, due to slow initiation. As 
shown in Table |I] a number of MC trials were unable to 
successfully complete for JITS due to a 32GB memory 
restriction, and the scenarios with n e {10,20} targets 
or A^^ = 80 could not be executed at all. The extension 



of the current paper in 1 23 1 demonstrates how the PPP 
component can be used to represent tracks with a low 
probability of existence to avoid maintaining such a large 
number of tracks. 
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Fig. 5. Peiformance of tracking methods on scenarios involving 
indistinguishable in case 1). Algorithms are encoded by colour as 
(dashed black). JITS is excluded from cases with n > 6 and A''' 
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Videos of a number of cases are included in the ancillary 
data on arXiv. 



VII. Conclusion 

In this paper, we have presented a derivation of a conjugate 
prior for a full Bayes RFS filter under common assump- 
tions, shown how the recently-developed LBP data association 
method can be applied to obtain a tractable approximation, 
and demonstrated its promising performance on a challenging 
problem. By comparing the resulting TOM-MeMBer/P algo- 
rithm with JITS/JIPDA, we effectively showed how variants 
of these popular methods can be derived using the RFS 
framework. The newly-proposed MOM-MeMBer/P algorithm 
follows the philosophy of the MeMBer alongside the LBP 
data association approximation to provide a solution that is 
robust to coalescence, and considerably improves upon the 
performance of CPHD and CB -MeMBer, especially with lower 
P"^ values. An existing extension of this work 023) demon- 
strates the significant computational benefit of representing 
Bernoulli components with a small probability of existence 
through the PPP component of the distribution. Our future 
work seeks to apply variational inference to tractably find 
a best-fitting multi-Bernoulli distribution, thus combining the 
superior performance of the TOM-MeMBer/P when targets are 
well-separated with the robustness against coalescence of the 
MOM-MeMBer/P 



Appendix A 
Proof of prediction step 

Proof of TheoremU\ The proof follows these steps: 



1) Substitute the p.g.fl form of the density (26i into the 



Bayes RFS prediction formula (15i 

2) Observe that the result is the product of two p.g.fls 

3) Show that the first component is a PPP, show that the 
intensity of the PPP is calculated using a standard PHD 
prediction step 1 1|20] 

4) Show that the second component is a MBM, and that 
the prediction step applies to each hypothesis of each 
track separately 

Substituting ([26]) into ( [T5| : 



Gt\t-i[h] (X explAi^i/i]} • G,_i|,_i[l - P' + P^pu] 
= expjA'^i/^]} • GPPP _J1 -P^ + P^ph]- 



GT%_,[l-P^ + P^Pu] 



(68) 



We assign the first two factors in this product to G^T^iVA 
and the final to G'/i^^l^l [/i] to obtain Gt\t-i[h] ^ G^^^\[h] ■ 
G'^^^^[h], where 

GPPP J/i] (X expjA^'i/i]} • GPPP ,_J1 -P^ + P'ph] (69) 

(70) 



G^^:^,[h]^G 



T-Mt-ii^ - P' + P'Ph] 



We will show that each of these retain their respective forms. 



i.e., (27 1 and (28 1. G'^F^-^lh] corresponds directly to the 



prediction step in the PHD |[T]: 

G'^^l.ih] (X explAi^i/i]} • GPPP ,_ Jl -P^+ P^ph] (71) 

explAi^H + Ar_i|,_ Jl -P^ + P^phW (72) 

exp{A'^H+Ar_i|,_jP^pJ} (73) 



ex ey. 
ex ey. 



= cM>H\t^im 



(74) 



where AJ^j_J/i] = A'^[/i] + Xi_^^_^[P''ph\- Thus G™i^[h] is 
indeed a PPP, with intensity given by ( [32| . 

We now consider G'^^^-y[h]; since G'^^'^^_-^[h] is a mix- 
ture of MeMBer distributions, this can be calculated via the 
MeMBer prediction equatioil'^ |3 , p675]: 



G1^^\[h] A GT^^,,,[1 ~P^ + P^p,] 









"" n G^tfiit-,[1-P^ + P^p,] 



i&Tt- 



"" n Gi^ii^ 



ieTt-i\t-i 



where 

i. 



G- [/.]^GJ- Jl-P^ + P^P.] 



(75) 



(76) 



(77) 



1 



+r 



1 



t-i\t-i // H^)ft\t~iix\x')dxP%x')fl-:^m_^{x')dx' 

(78) 



%-i+'^t]i-i I h{x)fl^^_^{x)dx 



(79) 



where the final equality results from the definitions of ( 35 i and 



(36|). Recognising ( |79] l as being Bernoulli, we obtain the result 
thi 

Ku 1 = ft! iu 1, ana will i = w. 



that G'^^^llh] is in the form (|28|), with nt\t-i = nt-x\t-\. 



<'t\t-i = K-i\t-v and w 



t\t~i 



t-i|t-i- 



Appendix B 
Proof of update step 

The structure of the proof of the Bayes update is as follows: 



1) Substitute the p.g.fl form of the density (26 1 into the 



Bayes RFS update formula (19i, where the joint p.g.fl 



of targets and measurements is given by p4] l 

2) Show that the result is a product of two p.g.fls, where 
the first is a PPP; this is proven in Lemma [T] 

3) Calculate the elements of the update of the second com- 
ponent (which we refer to as the remaining component) 

a) Updating the portion of the remaining component 
resulting from the PPP (Lemma |3]l 

b) Updating a Bernoulli component (Lemma pi 
The complete update of the remaining component can 
then be calculated from these elements using the product 
rule ( p5| l. 

4) By linearity of the derivative operator, the update in the 
remaining component can be broken into a sum with 
a term for each global hypothesis a in the prior MBM 
G^^!^^[h] (the first step in Theorem B 

'*With target survival/death but without target birth, as this is incorporated 
in the separate Poisson component C^F^ i ['^1 ■ 
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5) Given a prior global hypothesis (i.e., a global associ- 
ation history hypothesis involving measurements from 
previous time steps), the update can be calculated using 



the product rule (25 i; the resulting structure is shown in 
Lemma l5] to be in a form similar to (28 i with weights 
decomposing according to (pOb 
6) Subsequently (in Theorem |2pthe sum over prior asso- 
ciation hypotheses and hypotheses in the new scan are 
combined into a single sum 
Proof of LemmaU] Substituting (24i and (26 1 into (19 1, 

Gt\t[h] 

^ ' '"^''' l,\,^,[h{l-P^ + P%)]} 



The proof of Lemma |2] follows immediately from sub- 
stituting w, r and f{x) into the RHS of (87i. Lemma [s] 
calculates the unnormalised Bayes update for an unnormalised 
Poisson distribution in Poisson clutter (note that the undetected 
portion of the Poisson distribution has been removed as it is 
represented separately, in G^y^[h]). 

Lemma 3. 



5Z 



«^{exp{Af-[5]}-G, 



= n <i*(i 



3=0 



' t\t 



ry^^Ah]) (88) 



(80) 

ff=0 



where w^i^, r^ij and fiuix) are given in {50\, (51 \ and [52) 



5Zt 
■Gt-iH^ 



ff=0 

{ exp{Af-[5] + AJ^,_i[/^(l - P'' + P%)]}- 

9=0 



5Zt 



(81) 



(82) 



respectively (replacing z with zj). 

Proof: We use the derivative of a linear functional, and 
the chain rule: |3, p395] 



lm-m4jim-iFi9]-l^ny) 



(89) 



y=Flg] 



exp{AJl,_i[Ml-P')]}- 
^ -{eM^''^[9]+^t\t-l[hP%]}■ 






v^mbm 



P^ + pSs)]} 



(83) 



Using the result for linear functionals: 

= X'^{z) + X-^,_,[hP''f{z\-)] (90) 
Subsequently applying the chain rule: 
S 



5=0 



where ^0\ su bstitutes (|24lmto ([T9]l, ([81) substitutes (|26| for 
Gt|t_i[-], (82| substitutes (27|l for C^FJ^^h], and (|83|) exploits 
the fact thaH 



X^^,_,[ha-P'' + P%)] 
= fx'^^,_,{x)h{x){l-P''{x)+P''{x)pg{x)}dx (84) 

Xr^,_,{x)h{x){l~P\x)}dx 

K\t-ii^)Hx)P'^{x)Pg{x)dx (85) 

= AJ^,_i [/i(l -P'')]+ X^^,_,[hiP%)] (86) 

and that the factor exp{AJ^j_j^[/i(l — i^'^)]} is independent 
of the functional of differentiation g, and thus can be taken 
outside the derivative. Thus, by setting G^F^^ [h] to be the first 



exp{X%]+X^^,_,[hP^Pg]} 

= cxp{Af^[g]+AJ^,_i[/iP%]}- 

■{X'^z) + X-^,_,[hP^f{z\-)]) (91) 
Iterating the derivative, we find 
S 



6Z 



exp{X%]+X^^,_,[hP''p,]} 

^eM>^%]+K\t-i[hP%]}- 

■l[[x'^{z) + X'^^,_,[hP''f{z\-)]) (92) 
zez 

Setting g = 0, and applying Lemma |2] we obtain the desired 
result. ■ 

Lemma HI calculates the p.g.fl form of the Bayes update for 
an unnormalised Bernoulli distribution. 



line in (|83|l, and G™'^™[/i] to be equal to the second and third 



^t\t 



lines, we arrive at the results in ( (37] i, (|53|, (|54]l and ( |55| ). 

Lemma 2. Any p.g.fl of the following fornV^ is an unnor- 
malised Bernoulli distribution: 

a + b[h] = w{l — r + r ■ 

where w = a + J b{x)dx = a + b[l], r 

Hx) 
Ml]- 



ami 


I 4. 












5 
5Z 


i.ci' 


-.(^ 












^ ' t\t-iJ 


lifUm- 


pd 


+ p' 


%)]) 



m) 



(87) 



b[l 



Yj and f{x) 




9=0 
t\t \^~^t\t +^t|t Jt\t Wh ^-V-t\ 

0, otherwise 



(93) 



"with a > 0, b{x) > 0, < / b{x)dx < oo. If / b{x)dx = then any 
choice of f{x) will satisfy J87| . If a = and J b(x)dx = then any choice 
of r and f{x) will suffice. 



where w^" , r*|" and f^f^ {x) are given in [40), [41 i and [42 1 



it\t 



respectively, and w^j" , r^j" and /*|'" {x) are given in [44 \, 
(|45p and (|46|> respectively. 
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Proof: The case with Z = % resuhs simply from 
Lemma pi When Z = {zl}, we apply (89 1 and Lemma pi 



Lemma l5] shows that expressions of the form ( 25 1 can be 
expressed equivalently in a form similar to (28 i. 



Lemma 5. Suppose we are given set-parameterised function- 
als Ff[h], i e {0, ...,n}, such that F^[h] = Jlxez /o [^] "^"^ 
ifi>l and \Z\ > 2 then Ff[h] ^ 0. If Z = {z^ , . . . , z™}, 
then 



7i-]-7n 



E i[pr'[h]=j:^,n{-a)i[Frih] 



Wol±l---l±IH'„ = Zi=0 



where the sum over a spans all a — (a^, . . . , a"^™) such that 
a* € {0, . . . , m} V i, and 




ya^ A 



Ff.^M=\n\hl Z^{zi} (jG{l,...,m}) 

0, otherwise 

{0, 3i^i' s.t. a" = a"' ^ 0, 
or 3 j e {1, . . . , m} s.t. $ i s.t. a' = j 
1, otherwise 

Proof: We show that there is a one-to-one equivalence 
between non-zero terms in the LHS and RHS expressions. 
First, consider a non-zero LHS term involving (Wq, . . . , Wn)- 

For i e {1, . . . ,n} set a* = j if Wi = {z^, and a^ — 



if W, = 0. For j e {1, . 



, ,m 



}s.t. 



e VFo set a 



n+j _ 



J, 



Otherwise set a"+-' = 0. Since (Wq . . . , Wn) are disjoint and 
Wo U • • • U W„ = Z, we will have e„i(a) = 1, and the RHS 
term will be equivalent. By inspection, any change to a will 
result in a non-equivalent RHS term. Now consider a non-zero 



RHS term involving a = (a^, . 



~n-\-7n\ 



For i £ {1, . . . , n}. 



if a* > set W^ = {z^ }, and if a' = set W, = 0. Set 
Wq = U^i ^"^ ■ ^y assumption, em(a) — 1^ hence V j G 
{1, . . . , m} 3 i s.t. a' = j, so WqUIJILi W^* = ^- Also, $ i ^ 
i' s.t. ff = a^ ^ 0, so W^i are disjoint. Again, by inspection, 
any change to (Wi)"^o will result in a non-equivalent LHS 
term. ■ 

With these preliminary results, we are now ready to prove 
the main theorem. 

Proof of Theorem^ Following Lemma [T] the necessary 



result is that the G'^}'™ [h] of ( 55 i is made equal (or propor- 



t\t 



tional) to ( 28 1 through the equalmes made in the statement of 



the theorem (i.e., that G'^^"^[h] is a mixture of multi-Bernoulli 



distributions). Expanding the form of G",*!" [h] using (28 1 and 



( pOj l and applying the linearity property of differentials, we can 
write: 



Gt'^ih] (X 






et-i{a) 



6Zt 



n 



w. 



t\t- 






exp{A'-[5]+AJ],_i[/iP%]}. 



_^[h{l-P^ + P''p,)] 



i&Tti 



Considering a single term in this sum (i.e., a particular choice 



of a), we apply the product rule ( 25 i, taking derivatives with 
respect to the functional g, setting n — n^t_i, and 



Fo[g,h]^eM>^%]+X^,,_,[hP''pg]} 



(94) 



F,[g,h]=wlf_,G'^l[_,[h{l~P'' + P%)], z>0 (95) 

We then apply Lemma Is] with F^'[h] = ^7:^^4.9, ^]|g=o 
(observing through Lemmas [3] and |4] that the distributions have 
the forms required for Lemma l5]l to obtain: 



SZ, 



exp{A'^[.9]+AJl,_i[/iP%]}- 





9=0 


rit|t_i+mt 

= >;£-™.(«) n pf 

a i=l 


\h] (96) 



so that 



i+mt 



a.a i—1 

In order to combine the sums over historical hypotheses a 
and current scan hypotheses a into a single sum, define for 
i G Tt\t-i (i-e., pre-existing tracks), 

• K\t ^K\t-ii^ + '^t) 

. for a* — a* (i.e., a' = 0), define A^*(i,a*) = 

. for a* = a* + ft.J|j_ja*, a^ = j <E {l,...,mt}, define 

and for i — nt\t~i + j, j' G {1, . . . , rat] (i-e., new tracks), 

• a* G {1,2} where a^ — 1 corresponds to a* = 0, and 
a* = 2 corresponds to a' = j 

. MHi,l) = $, and M\t,2) = {{t,j)} 

Subsequently the sum over a and a is equivalent to a sin- 
gle sum over a (noting that, under the stated correspon- 
dences, ei_i(a)emj(a) — et{a)). Lemmas Is] and H show that 
Ff'^ [h] = wjjf (l-rjjf +rl^i^f[h]) where the respective 
parameters are given by (|40)l-(|42]l (missed detections on exist- 
ing tracks, i.e., i G {1, . . . ,nt\t-i}, a* G {1, . . . , ft.J|j_ J), 
(|44])-(46l (detections updating existing tracks, i.e., i G 
{1, . . . , nt\t-i}, a' G {/ij|f_i + 1, • ■ . , /tj| J), Q (new tracks 
without measurements, i.e., i G {nt\t-i + 1, . . . , nt\t~i +"T-t}, 
a* = 1), and (50i-(52i (updates of new tracks, i.e., i G 
{nt\t-i + 1, • ■ ■ , rtt|t-i + mt}, a' = 2). ■ 

Appendix C 
Implementation pseudo-code 

The simplest implementation of the algorithm uses a Gaussian 
approximation of each track, and a Gaussian mixture approx- 
imation of the PPP intensity of unknown targets. Pseudo-code 
for this is provided in this section for completeness; an earlier 
variant appeared in p6) (along with experimental results). 



Note that the experiments in Section VI utilised a Gaussian 
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mixture implementation, which is complicated by hypothesis 
management, etc. 

Additional assumptions include: 

• Probability of detection and survival are uniform, i.e., 

P^{x) =pd^ ps(^^^ ^ ps 

• Target-originated measurements follow a linear-Gaussian 
model, zt = Hxt + vt, where vt ^ M{vt\ 0, R} 

• Target dynamics follow a linear-Gaussian model, xt — 
Fxt-i + Wt, where wt ^ N{wt\Q^ Q} 

• Target birth intensity is 



A^a;,) = ^A''■^■AA{x*;x^'^P^•n 



fc=i 



The state maintained by the algorithm is: 
• A Gaussian mixture representation of the intensity of the 
PPP of unknown targets. 



Xt. 



t\t'\^t) - 2^ A^;,A/'{a;t;a::j"'j,,Pj|j, 



n 



fe=i 



• n^\ti tracks (Bernoulli components), each comprising of a 
probability of existence r\^,, a mean x\,^, and covariance 

The algorithm operates by executing the prediction step in 
Algorithm [T] the update step in Algorithm l2] LBP calculation 
of marginal association probabilities Algorithm [31 and then 
either the TOM-MeMBer/P procedure for forming new tracks. 
Algorithm |4] or the MOM-MeMBer/P procedure for forming 
new tracks. Algorithm l5] Steps for eliminating tracks with 
a low probability of existence and components in \'^,^,{x) 
with a low weight are not shown. Between update and re- 
forming of tracks, {rrit + 1) hypotheses exist for each pre- 
existing track i, parameterised by a hypothesis weight wjj", 

and hypothesis-conditioned probability of existence r*;", mean 

Xji", and covariance Pj]",- A single hypothesis exists for the 
new tracks, denoted by a = 1 (the non-existence hypothesis 



in ( 48 1 is accounted for without explicitly storing its weight. 



probability of existence, mean and covariance). 
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Algorithm 1: Prediction algorithm. 



input : nt-i\t-i, {r\_^^_^,x\_^\^_^,V\_^^^_^)\/ i e 
{l,...,nt_i|t_i}; <_i,t_i, 

f^u,k -u,k pu,fe w t. p 

y'^t-iit-i^ -^t-iit-i' ^ t-i\t-i) V «- 1 

input : nt\t-i, 

(''t|t-l' ^t|t-l' "t|t-l) V J € {1, 



, »^t|t-i}; 



"tit-v 

i \ u.k -u,k 

(A,u ,,x. 



pu,fc \ 
't|t-l'-^t|t-l' ^ t\t-l) 

1 Predict existing tracks 
3 for ie {l,...,nt|t_i} do 



Vfcejl, 



^t\t- 



-J 



' t\t-i 



P^'r. 



t-i|t-i 



"-t!*- 



t|t-i 



— Ft* 

1 ■ '- -^t-ilt-i 

-FPUilt-iF^ 



Q 



7 end 

8 Predict existing PPP intensity 



9 n" 



"tlt-i ■" '"t-ilt-i 
10 for fc e {1, . . . ,n" 



lido 

1 



t-i\t- 

xii.fe . ps \n,k 

'^tlt-l ■" ^ ^t-l|t 

^t\t-l ■~ ■'^*^t-l|t-l 

rtu.k Tppu,fc TpT 

^t|i-l •" -^^t-llt-l^ 



11 

12 
13 

14 end 

15 Incorporate birth intensity into PPP 

16 for fee {!,..., n*'} do 



Q 



17 
18 
19 



A 



t\t-l ^ ■- ^ 

^'^+^t-l|t-l 



b,k 

t-l\t- 

b,k 



20 end 



[j"'*''+"t-iit-i 
t\t-i 



"t-ilt-i 

■pb,k 
^t-l|t-l 
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Algorithm 2: Component update algorithm. 
input 



^nt^ 


-1' (^^t-i'2;t"'t_i,Pt]t_i) Vfc e 


{!,• 


■ , n'i\t-iy: "*!*-!' (^tV-i' ^*l*-i' P*|t-i) ^ * ^ 


{!,• 


•,rit|,_i}; Z, = {zi,...,z™'} 


«^t' 


(A;;f,xr|f,Pr|f)Vfce{l,...,n«,_J; 



'^t|t) l^t|t '''t|t '2: 



t|i,Ptp Vae {0, ...,TOt} for 



i G {1, . . . , n4|f_i}, and a e {1} for 
i G {nt|t_i,...,nt|t_i +mt} 

1 Update existing tracks 

2 for iG {l,...,ni|t_i} do 



6 

7 

g 

9 

10 

11 

12 

13 

14 

15 



Create missed detection hypothesis 

Ji°:=l-rj|,_,+rj|,_i(l-P<i) 
1,0 , 

t|t ■ 

i,0 

t|t •" -^tlt-l' -^ t\t ■~ "- t\t-\ 



' t\t-l "^ ' t\t 



i-'-;it-i+':;it-i(i-p") 

Create hypotheses with measurement updates 

S:=HPj|,_,H^ + R 

P+:=PJ|,_i"KHPj|,_ 
for j G {1, . . . ,r7if} do 



i/ 



w 



end 



-i.j ._ 









K-; P^i^ 



16 end 

17 Create a new track for each measurement by updating 
PPP with measurement 

18 for A: G {1, ... , n^^t^i} do 
H^ + R 



"tit- 



22 end 

23 for j G {1, . . . 

i ■= nt\t^i 
for k G {1, 

v := z^ 



f|t-i 
TOi} do 



K'=HP 



f|t-i 



J 



. , n 



t\t- 



-lido 



HS 



,,fc 
t|t-i 



c 



/i; -U.k 

y -^ ^tit-i 



end 

c :=. E£r^ c^ wi - 

i.l 

■r. 
P 



i,l -4,1 

'f|t ■— ^/ "^tit ^t|t ■— C 

-,1,1 

•~ c 



C-KA'^(zJ); 






^f|t 



fe=i 



f|t •— c ^k=i ^ L-' '^ y-^t\t V )(^tif y ) 

32 end 

33 Update (i.e., thin) intensity of unknown targets 

34 "^It = <|t-l 

35 for A: G {1, . 

36 ^"■'^ 



k\T^ 



. , n 



37 



t|t- 



lido 



A"f := (1 - P'1)A: 



a; 



38 end 



t|f 

u,k 
t\t 



X 



t|f- 



-1' -^tlt 



T' 



t|t-i 



Algoritlim 3: LBP algorithm for approximation of 
marginal association probabilities. 

input : Association weights u;*'" = w*;" V i G 



^t|t 



{l,...,nt|t„i},a G {0, 
false alarm/new target intensity 

it;"^J = ly^i^' V J G |1, . . . , ?nt|, 



^f|t 
convergence threshold e 

output: Approximate marginal probabilities p*(a) 

1 Run LBP iteration 

2 wliile max,; ,- |/i(,3-i„» - /i(,3-i„»l > e do 

3 



4 
5 

6 

7 

8 

9 

10 



for iG {l,...,7it|t„i} do 



s := 

Ma' 






E^l^'"'Mfc^->a' 



s-u'^'^Mft 



V j G {l,...,mt} 



end 

for J G {1, 



fJ-bJ- 



,mt} do 

v-~>"t|t-l 

V2G{1, 



^t| 



t-ll 



11 

12 end 

13 Calculate outputs-for existing tracks and then new tracks 

14 for iG {l,...,nt|t_i} do 
s := t.«^o + ^7' 



,,»,j. 



.iM' '■'A'fei- 



p*(0) :=w*'7s 



18 end 

19 for j G {1, . . . ,mt} do 

20 s := w'>'^ -h ^JjJl Ma-^6^ 

21 p"*i'-i+J(l) := wOj/s 

22 end 



,"!(} 



Algorithm 4: TOM-MeMBer/P algorithm for forming new 

tracks. 

input : Tracks updated using Algorithmic] marginal 

probability estimates p'(a). 
output: Re-formed, single hypothesis tracks, each 
comprising existence probability r!,,, state 



estimate x\^, and covariance P 



t|f' 



1 Form continuing tracks 

2 for j G {l,...,nt|t_i} do 

^i,j 
t|t 



t|f 



-?|t:=E.r=oP^O> 






pi _ 

^r^iiomr'^Kii + i 



t\t -^tif 



^tlf)(^: 



t|f 



''tlt^ 



6 end 

7 Fomi new tracks (already single hypothesis) 

8 for i G {nt\t-i + 1, . . . , nt\t} do 

9 



r!,. :=pH1)^;'; xlu := x!;h P!u := P!'' 



10 end 



' f|t 



't|t' -^tit 



"fit' -^ f|t 



t|f 
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Algorithm 5: MOM-MeMBer/P algorithm. 

input : Tracks updated using Algorithmic] marginal 
probability estimates p^{a). 
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comprising existence probability rj^, state 
estimate xj^, and covariance P*|j. 

1 Generate legacy (missed detection) tracks 
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4 end 
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